Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 5000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 781.4 KiB |
| Average record size in memory | 160.0 B |
Variable types
| Categorical | 3 |
|---|---|
| Boolean | 5 |
| Numeric | 12 |
Salary is highly correlated with Base_pay and 6 other fields | High correlation |
Base_pay is highly correlated with Salary and 6 other fields | High correlation |
Bonus is highly correlated with Salary and 6 other fields | High correlation |
Unit_Price is highly correlated with Salary and 6 other fields | High correlation |
low is highly correlated with Salary and 6 other fields | High correlation |
Unit_Sales is highly correlated with Salary and 6 other fields | High correlation |
Total_Sales is highly correlated with Salary and 6 other fields | High correlation |
Months is highly correlated with Salary and 6 other fields | High correlation |
Salary is highly correlated with Base_pay and 6 other fields | High correlation |
Base_pay is highly correlated with Salary and 6 other fields | High correlation |
Bonus is highly correlated with Salary and 6 other fields | High correlation |
Unit_Price is highly correlated with Salary and 5 other fields | High correlation |
low is highly correlated with Salary and 6 other fields | High correlation |
Unit_Sales is highly correlated with Salary and 5 other fields | High correlation |
Total_Sales is highly correlated with Salary and 6 other fields | High correlation |
Months is highly correlated with Salary and 4 other fields | High correlation |
Salary is highly correlated with Base_pay and 6 other fields | High correlation |
Base_pay is highly correlated with Salary and 6 other fields | High correlation |
Bonus is highly correlated with Salary and 6 other fields | High correlation |
Unit_Price is highly correlated with Salary and 5 other fields | High correlation |
low is highly correlated with Salary and 6 other fields | High correlation |
Unit_Sales is highly correlated with Salary and 6 other fields | High correlation |
Total_Sales is highly correlated with Salary and 6 other fields | High correlation |
Months is highly correlated with Salary and 5 other fields | High correlation |
Age is highly correlated with Salary and 5 other fields | High correlation |
Salary is highly correlated with Age and 9 other fields | High correlation |
Base_pay is highly correlated with Age and 9 other fields | High correlation |
Bonus is highly correlated with Age and 9 other fields | High correlation |
Unit_Price is highly correlated with Salary and 7 other fields | High correlation |
openingbalance is highly correlated with closingbalance and 2 other fields | High correlation |
closingbalance is highly correlated with Salary and 8 other fields | High correlation |
low is highly correlated with Salary and 7 other fields | High correlation |
Unit_Sales is highly correlated with Age and 9 other fields | High correlation |
Total_Sales is highly correlated with Age and 8 other fields | High correlation |
Months is highly correlated with Age and 9 other fields | High correlation |
Education is highly correlated with Salary and 2 other fields | High correlation |
Salary has unique values | Unique |
Bonus has unique values | Unique |
Reproduction
| Analysis started | 2021-12-05 14:29:57.794011 |
|---|---|
| Analysis finished | 2021-12-05 14:31:58.634540 |
| Duration | 2 minutes and 0.84 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
Gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Male | |
|---|---|
| Female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.9888 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Male |
| 4th row | Female |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Male | 2528 | |
| Female | 2472 |
Length
Pie chart
| Value | Count | Frequency (%) |
| male | 2528 | |
| female | 2472 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Business
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 4200 | |
| True | 800 | 16.0% |
Dependancies
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 3524 | |
| True | 1476 |
Calls
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 4539 | |
| False | 461 | 9.2% |
Type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| Month-to-month | |
|---|---|
| Two year | |
| One year |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 11.3324 |
| Min length | 8 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Month-to-month |
|---|---|
| 2nd row | Month-to-month |
| 3rd row | Month-to-month |
| 4th row | Month-to-month |
| 5th row | Month-to-month |
Common Values
| Value | Count | Frequency (%) |
| Month-to-month | 2777 | |
| Two year | 1195 | |
| One year | 1028 | 20.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| month-to-month | 2777 | |
| year | 2223 | |
| two | 1195 | |
| one | 1028 | 14.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Billing
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 2956 | |
| False | 2044 |
Rating
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 3682 | |
| True | 1318 | 26.4% |
| Distinct | 65 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.865 |
| Minimum | 18 |
|---|---|
| Maximum | 88 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 38 |
| Q1 | 47 |
| median | 52 |
| Q3 | 57 |
| 95-th percentile | 65 |
| Maximum | 88 |
| Range | 70 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 8.560691099 |
|---|---|
| Coefficient of variation (CV) | 0.1650571888 |
| Kurtosis | 0.8620040667 |
| Mean | 51.865 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.2599714625 |
| Sum | 259325 |
| Variance | 73.28543209 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 256 | 5.1% |
| 53 | 254 | 5.1% |
| 55 | 248 | 5.0% |
| 54 | 245 | 4.9% |
| 51 | 244 | 4.9% |
| 56 | 238 | 4.8% |
| 52 | 234 | 4.7% |
| 49 | 222 | 4.4% |
| 47 | 204 | 4.1% |
| 57 | 200 | 4.0% |
| Other values (55) | 2655 |
| Value | Count | Frequency (%) |
| 18 | 2 | < 0.1% |
| 19 | 3 | 0.1% |
| 20 | 4 | |
| 21 | 8 | |
| 22 | 6 | |
| 23 | 7 | |
| 24 | 5 | |
| 25 | 4 | |
| 26 | 3 | 0.1% |
| 27 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 88 | 2 | < 0.1% |
| 85 | 1 | < 0.1% |
| 82 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 79 | 1 | < 0.1% |
| 78 | 2 | < 0.1% |
| 76 | 5 | |
| 75 | 6 | |
| 74 | 9 | |
| 73 | 10 |
| Distinct | 5000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99821.92855 |
| Minimum | 5089 |
|---|---|
| Maximum | 199970.74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 5089 |
|---|---|
| 5-th percentile | 57978.14651 |
| Q1 | 83890.33898 |
| median | 100579.3785 |
| Q3 | 116912.0925 |
| 95-th percentile | 139003.3586 |
| Maximum | 199970.74 |
| Range | 194881.74 |
| Interquartile range (IQR) | 33021.75349 |
Descriptive statistics
| Standard deviation | 25376.96174 |
|---|---|
| Coefficient of variation (CV) | 0.2542223148 |
| Kurtosis | 0.9593172698 |
| Mean | 99821.92855 |
| Median Absolute Deviation (MAD) | 16440.20885 |
| Skewness | -0.3960416 |
| Sum | 499109642.8 |
| Variance | 643990187.4 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 136944.7369 | 1 | < 0.1% |
| 80831.25296 | 1 | < 0.1% |
| 87070.84105 | 1 | < 0.1% |
| 100185.4335 | 1 | < 0.1% |
| 92234.2413 | 1 | < 0.1% |
| 115437.0058 | 1 | < 0.1% |
| 112634.7853 | 1 | < 0.1% |
| 75590.50714 | 1 | < 0.1% |
| 134402.6776 | 1 | < 0.1% |
| 104797.2382 | 1 | < 0.1% |
| Other values (4990) | 4990 |
| Value | Count | Frequency (%) |
| 5089 | 1 | |
| 5698.12 | 1 | |
| 5896.65 | 1 | |
| 6125.12 | 1 | |
| 6245 | 1 | |
| 6444.23 | 1 | |
| 6455.5 | 1 | |
| 6458.35722 | 1 | |
| 6529.23 | 1 | |
| 6682.33 | 1 |
| Value | Count | Frequency (%) |
| 199970.74 | 1 | |
| 195970.7 | 1 | |
| 192636.8 | 1 | |
| 185685.9 | 1 | |
| 180696.8 | 1 | |
| 175689.3 | 1 | |
| 170639.5565 | 1 | |
| 170372.5473 | 1 | |
| 169149.707 | 1 | |
| 168974.528 | 1 |
| Distinct | 4884 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40046.18771 |
| Minimum | 2035.6 |
|---|---|
| Maximum | 79988.296 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 2035.6 |
|---|---|
| 5-th percentile | 23662.82554 |
| Q1 | 33744.02163 |
| median | 40231.75141 |
| Q3 | 46764.83697 |
| 95-th percentile | 55855.84708 |
| Maximum | 79988.296 |
| Range | 77952.696 |
| Interquartile range (IQR) | 13020.81534 |
Descriptive statistics
| Standard deviation | 10112.34245 |
|---|---|
| Coefficient of variation (CV) | 0.2525169818 |
| Kurtosis | 1.083725523 |
| Mean | 40046.18771 |
| Median Absolute Deviation (MAD) | 6508.79148 |
| Skewness | -0.365034841 |
| Sum | 200230938.5 |
| Variance | 102259469.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40046.18771 | 23 | 0.5% |
| 53351.51902 | 21 | 0.4% |
| 56174.47734 | 16 | 0.3% |
| 61235.51239 | 12 | 0.2% |
| 60807.2653 | 11 | 0.2% |
| 72278.72 | 8 | 0.2% |
| 54310.97997 | 8 | 0.2% |
| 57671.40939 | 6 | 0.1% |
| 50544.82434 | 5 | 0.1% |
| 54699.29426 | 5 | 0.1% |
| Other values (4874) | 4885 |
| Value | Count | Frequency (%) |
| 2035.6 | 1 | |
| 2279.248 | 1 | |
| 2358.66 | 1 | |
| 2450.048 | 1 | |
| 2498 | 1 | |
| 2577.692 | 1 | |
| 2582.2 | 1 | |
| 2583.342888 | 1 | |
| 2611.692 | 1 | |
| 2672.932 | 1 |
| Value | Count | Frequency (%) |
| 79988.296 | 1 | < 0.1% |
| 78388.28 | 1 | < 0.1% |
| 77054.72 | 1 | < 0.1% |
| 74274.36 | 1 | < 0.1% |
| 72278.72 | 8 | |
| 70275.72 | 1 | < 0.1% |
| 68255.82259 | 1 | < 0.1% |
| 68149.01893 | 1 | < 0.1% |
| 67659.8828 | 1 | < 0.1% |
| 66052.99437 | 1 | < 0.1% |
| Distinct | 5000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4991.096428 |
| Minimum | 254.45 |
|---|---|
| Maximum | 9998.537 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 254.45 |
|---|---|
| 5-th percentile | 2898.907326 |
| Q1 | 4194.51695 |
| median | 5028.968925 |
| Q3 | 5845.604624 |
| 95-th percentile | 6950.16793 |
| Maximum | 9998.537 |
| Range | 9744.087 |
| Interquartile range (IQR) | 1651.087674 |
Descriptive statistics
| Standard deviation | 1268.848087 |
|---|---|
| Coefficient of variation (CV) | 0.2542223148 |
| Kurtosis | 0.9593172706 |
| Mean | 4991.096428 |
| Median Absolute Deviation (MAD) | 822.0104425 |
| Skewness | -0.3960416001 |
| Sum | 24955482.14 |
| Variance | 1609975.468 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 5728.02081 | 1 | < 0.1% |
| 5801.83736 | 1 | < 0.1% |
| 4299.486801 | 1 | < 0.1% |
| 4868.720636 | 1 | < 0.1% |
| 4569.408366 | 1 | < 0.1% |
| 4524.397931 | 1 | < 0.1% |
| 4987.184195 | 1 | < 0.1% |
| 5874.07095 | 1 | < 0.1% |
| 4600.944255 | 1 | < 0.1% |
| 7313.18372 | 1 | < 0.1% |
| Other values (4990) | 4990 |
| Value | Count | Frequency (%) |
| 254.45 | 1 | |
| 284.906 | 1 | |
| 294.8325 | 1 | |
| 306.256 | 1 | |
| 312.25 | 1 | |
| 322.2115 | 1 | |
| 322.775 | 1 | |
| 322.917861 | 1 | |
| 326.4615 | 1 | |
| 334.1165 | 1 |
| Value | Count | Frequency (%) |
| 9998.537 | 1 | |
| 9798.535 | 1 | |
| 9631.84 | 1 | |
| 9284.295 | 1 | |
| 9034.84 | 1 | |
| 8784.465 | 1 | |
| 8531.977825 | 1 | |
| 8518.627365 | 1 | |
| 8457.48535 | 1 | |
| 8448.7264 | 1 |
| Distinct | 3836 |
|---|---|
| Distinct (%) | 76.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.25852201 |
| Minimum | 1.44 |
|---|---|
| Maximum | 629.511067 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 1.44 |
|---|---|
| 5-th percentile | 12.94799715 |
| Q1 | 25.72749975 |
| median | 39.205 |
| Q3 | 58.71500025 |
| 95-th percentile | 124.5604981 |
| Maximum | 629.511067 |
| Range | 628.071067 |
| Interquartile range (IQR) | 32.9875005 |
Descriptive statistics
| Standard deviation | 52.24402237 |
|---|---|
| Coefficient of variation (CV) | 1.019226078 |
| Kurtosis | 53.81530851 |
| Mean | 51.25852201 |
| Median Absolute Deviation (MAD) | 15.455 |
| Skewness | 5.989663043 |
| Sum | 256292.6101 |
| Variance | 2729.437874 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26 | 7 | 0.1% |
| 42.200001 | 6 | 0.1% |
| 20.809999 | 5 | 0.1% |
| 20.059999 | 5 | 0.1% |
| 17 | 5 | 0.1% |
| 37.290001 | 5 | 0.1% |
| 21.620001 | 5 | 0.1% |
| 48.490002 | 5 | 0.1% |
| 22.219999 | 5 | 0.1% |
| 14.38 | 5 | 0.1% |
| Other values (3826) | 4947 |
| Value | Count | Frequency (%) |
| 1.44 | 1 | |
| 1.47 | 1 | |
| 1.51 | 1 | |
| 1.52 | 1 | |
| 1.6 | 1 | |
| 1.61 | 1 | |
| 1.62 | 1 | |
| 1.63 | 1 | |
| 1.65 | 1 | |
| 1.66 | 1 |
| Value | Count | Frequency (%) |
| 629.511067 | 1 | |
| 629.510005 | 1 | |
| 627.841071 | 1 | |
| 627.839984 | 1 | |
| 625.861078 | 1 | |
| 625.860033 | 1 | |
| 610.001045 | 1 | |
| 609.999993 | 1 | |
| 604.46106 | 1 | |
| 604.460006 | 1 |
Volume
Real number (ℝ≥0)
| Distinct | 4831 |
|---|---|
| Distinct (%) | 96.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6761260.34 |
| Minimum | 0 |
|---|---|
| Maximum | 320868400 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 453495 |
| Q1 | 1283850 |
| median | 2870600 |
| Q3 | 6247100 |
| 95-th percentile | 22752030 |
| Maximum | 320868400 |
| Range | 320868400 |
| Interquartile range (IQR) | 4963250 |
Descriptive statistics
| Standard deviation | 16204756.36 |
|---|---|
| Coefficient of variation (CV) | 2.396706464 |
| Kurtosis | 100.1371799 |
| Mean | 6761260.34 |
| Median Absolute Deviation (MAD) | 1969750 |
| Skewness | 8.709735196 |
| Sum | 3.38063017 × 1010 |
| Variance | 2.625941288 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 779500 | 4 | 0.1% |
| 548000 | 4 | 0.1% |
| 1443400 | 3 | 0.1% |
| 1346400 | 3 | 0.1% |
| 3337500 | 2 | < 0.1% |
| 2186900 | 2 | < 0.1% |
| 411200 | 2 | < 0.1% |
| 2105000 | 2 | < 0.1% |
| 5975500 | 2 | < 0.1% |
| 1932400 | 2 | < 0.1% |
| Other values (4821) | 4974 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 3700 | 1 | |
| 5100 | 1 | |
| 9800 | 1 | |
| 10000 | 1 | |
| 19400 | 1 | |
| 32200 | 1 | |
| 45500 | 1 | |
| 95100 | 1 | |
| 97300 | 1 |
| Value | Count | Frequency (%) |
| 320868400 | 1 | |
| 223486800 | 1 | |
| 220104700 | 1 | |
| 215620200 | 1 | |
| 209521300 | 1 | |
| 205257900 | 1 | |
| 200070600 | 1 | |
| 198078600 | 1 | |
| 195117100 | 1 | |
| 192609900 | 1 |
| Distinct | 2986 |
|---|---|
| Distinct (%) | 59.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.73326344 |
| Minimum | 3.68 |
|---|---|
| Maximum | 313.9039044 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 3.68 |
|---|---|
| 5-th percentile | 12.32789947 |
| Q1 | 26.39763289 |
| median | 33.119999 |
| Q3 | 42.52500025 |
| 95-th percentile | 109.0624981 |
| Maximum | 313.9039044 |
| Range | 310.2239044 |
| Interquartile range (IQR) | 16.12736736 |
Descriptive statistics
| Standard deviation | 32.57885267 |
|---|---|
| Coefficient of variation (CV) | 0.7998095395 |
| Kurtosis | 24.51621409 |
| Mean | 40.73326344 |
| Median Absolute Deviation (MAD) | 7.49049875 |
| Skewness | 4.094790645 |
| Sum | 203666.3172 |
| Variance | 1061.381641 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33.119999 | 1479 | |
| 31.6 | 5 | 0.1% |
| 20.959999 | 5 | 0.1% |
| 26.530001 | 4 | 0.1% |
| 43.889999 | 4 | 0.1% |
| 24.540001 | 4 | 0.1% |
| 32.290001 | 4 | 0.1% |
| 54.029999 | 4 | 0.1% |
| 38.099998 | 4 | 0.1% |
| 31.99 | 4 | 0.1% |
| Other values (2976) | 3483 |
| Value | Count | Frequency (%) |
| 3.68 | 1 | |
| 3.71 | 1 | |
| 3.75 | 1 | |
| 3.85 | 1 | |
| 4.21 | 1 | |
| 4.23 | 1 | |
| 4.26 | 1 | |
| 4.31 | 1 | |
| 4.4 | 1 | |
| 4.79 | 1 |
| Value | Count | Frequency (%) |
| 313.9039044 | 1 | |
| 313.7887918 | 1 | |
| 313.2432598 | 1 | |
| 310.720001 | 1 | |
| 310.670013 | 1 | |
| 309.369995 | 1 | |
| 308.959991 | 1 | |
| 308.899994 | 1 | |
| 308.01001 | 1 | |
| 307.200012 | 1 |
| Distinct | 4011 |
|---|---|
| Distinct (%) | 80.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.57782761 |
| Minimum | 3.68 |
|---|---|
| Maximum | 313.6886942 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 3.68 |
|---|---|
| 5-th percentile | 11.2195 |
| Q1 | 21.99 |
| median | 33.34 |
| Q3 | 51.1175005 |
| 95-th percentile | 122.3109982 |
| Maximum | 313.6886942 |
| Range | 310.0086942 |
| Interquartile range (IQR) | 29.1275005 |
Descriptive statistics
| Standard deviation | 37.14851225 |
|---|---|
| Coefficient of variation (CV) | 0.8524636102 |
| Kurtosis | 15.83527458 |
| Mean | 43.57782761 |
| Median Absolute Deviation (MAD) | 13.85711096 |
| Skewness | 3.22636805 |
| Sum | 217889.1381 |
| Variance | 1380.011962 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37.919998 | 7 | 0.1% |
| 35 | 7 | 0.1% |
| 18.4 | 6 | 0.1% |
| 22.27 | 6 | 0.1% |
| 43.66 | 5 | 0.1% |
| 33.25 | 5 | 0.1% |
| 33.34 | 5 | 0.1% |
| 38.93 | 5 | 0.1% |
| 53.25 | 5 | 0.1% |
| 25 | 5 | 0.1% |
| Other values (4001) | 4944 |
| Value | Count | Frequency (%) |
| 3.68 | 1 | |
| 3.76 | 1 | |
| 3.86 | 1 | |
| 4.21 | 1 | |
| 4.22 | 1 | |
| 4.28 | 1 | |
| 4.29 | 1 | |
| 4.31 | 1 | |
| 4.33 | 1 | |
| 4.41 | 1 |
| Value | Count | Frequency (%) |
| 313.6886942 | 1 | |
| 312.3073158 | 1 | |
| 312.2053084 | 1 | |
| 311.839996 | 1 | |
| 311.290009 | 1 | |
| 310.8304586 | 1 | |
| 310.670013 | 1 | |
| 308.959991 | 1 | |
| 308.769989 | 1 | |
| 308.579987 | 1 |
| Distinct | 4014 |
|---|---|
| Distinct (%) | 80.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.03412904 |
| Minimum | 3.65 |
|---|---|
| Maximum | 312.4324379 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 3.65 |
|---|---|
| 5-th percentile | 10.9885 |
| Q1 | 21.71874969 |
| median | 32.880001 |
| Q3 | 50.415 |
| 95-th percentile | 121.332002 |
| Maximum | 312.4324379 |
| Range | 308.7824379 |
| Interquartile range (IQR) | 28.69625031 |
Descriptive statistics
| Standard deviation | 36.76064128 |
|---|---|
| Coefficient of variation (CV) | 0.8542206406 |
| Kurtosis | 15.91827818 |
| Mean | 43.03412904 |
| Median Absolute Deviation (MAD) | 13.67 |
| Skewness | 3.233666708 |
| Sum | 215170.6452 |
| Variance | 1351.344747 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32.380001 | 6 | 0.1% |
| 24.41 | 5 | 0.1% |
| 25.719999 | 5 | 0.1% |
| 35 | 5 | 0.1% |
| 33.169998 | 5 | 0.1% |
| 24.25 | 5 | 0.1% |
| 21.860001 | 4 | 0.1% |
| 25.76 | 4 | 0.1% |
| 33.5 | 4 | 0.1% |
| 45.27 | 4 | 0.1% |
| Other values (4004) | 4953 |
| Value | Count | Frequency (%) |
| 3.65 | 2 | |
| 3.72 | 1 | |
| 3.83 | 1 | |
| 4.08 | 1 | |
| 4.13 | 1 | |
| 4.15 | 1 | |
| 4.21 | 1 | |
| 4.22 | 1 | |
| 4.27 | 1 | |
| 4.66 | 1 |
| Value | Count | Frequency (%) |
| 312.4324379 | 1 | |
| 311.0810891 | 1 | |
| 310.9550008 | 1 | |
| 309.6100281 | 1 | |
| 309.420013 | 1 | |
| 308.48999 | 1 | |
| 307.399994 | 1 | |
| 305.799988 | 1 | |
| 305.459991 | 1 | |
| 305.450012 | 1 |
| Distinct | 1434 |
|---|---|
| Distinct (%) | 28.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.84151 |
| Minimum | 18.25 |
|---|---|
| Maximum | 118.75 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 18.25 |
|---|---|
| 5-th percentile | 19.65 |
| Q1 | 35.5 |
| median | 70.5 |
| Q3 | 89.95 |
| 95-th percentile | 107.6525 |
| Maximum | 118.75 |
| Range | 100.5 |
| Interquartile range (IQR) | 54.45 |
Descriptive statistics
| Standard deviation | 30.13967985 |
|---|---|
| Coefficient of variation (CV) | 0.4648207583 |
| Kurtosis | -1.25800149 |
| Mean | 64.84151 |
| Median Absolute Deviation (MAD) | 24.1 |
| Skewness | -0.2254472442 |
| Sum | 324207.55 |
| Variance | 908.4003015 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 19.85 | 36 | 0.7% |
| 20.05 | 35 | 0.7% |
| 19.9 | 32 | 0.6% |
| 19.7 | 32 | 0.6% |
| 20.25 | 32 | 0.6% |
| 20 | 31 | 0.6% |
| 19.55 | 30 | 0.6% |
| 20.15 | 29 | 0.6% |
| 19.8 | 29 | 0.6% |
| 19.95 | 29 | 0.6% |
| Other values (1424) | 4685 |
| Value | Count | Frequency (%) |
| 18.25 | 1 | < 0.1% |
| 18.4 | 1 | < 0.1% |
| 18.7 | 1 | < 0.1% |
| 18.75 | 1 | < 0.1% |
| 18.8 | 4 | |
| 18.85 | 4 | |
| 18.9 | 1 | < 0.1% |
| 18.95 | 5 | |
| 19 | 5 | |
| 19.05 | 7 |
| Value | Count | Frequency (%) |
| 118.75 | 1 | |
| 118.65 | 1 | |
| 118.6 | 2 | |
| 117.8 | 1 | |
| 117.6 | 1 | |
| 117.45 | 1 | |
| 117.2 | 1 | |
| 117.15 | 1 | |
| 116.85 | 1 | |
| 116.8 | 1 |
| Distinct | 4706 |
|---|---|
| Distinct (%) | 94.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2269.56846 |
| Minimum | 18.8 |
|---|---|
| Maximum | 8684.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 18.8 |
|---|---|
| 5-th percentile | 49.8475 |
| Q1 | 389.2125 |
| median | 1395.65 |
| Q3 | 3722.3375 |
| 95-th percentile | 6889.81 |
| Maximum | 8684.8 |
| Range | 8666 |
| Interquartile range (IQR) | 3333.125 |
Descriptive statistics
| Standard deviation | 2264.62695 |
|---|---|
| Coefficient of variation (CV) | 0.9978227094 |
| Kurtosis | -0.2065550055 |
| Mean | 2269.56846 |
| Median Absolute Deviation (MAD) | 1215.175 |
| Skewness | 0.976270175 |
| Sum | 11347842.3 |
| Variance | 5128535.222 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1395.65 | 16 | 0.3% |
| 19.75 | 6 | 0.1% |
| 20.15 | 6 | 0.1% |
| 19.45 | 6 | 0.1% |
| 20.2 | 6 | 0.1% |
| 45.3 | 5 | 0.1% |
| 20.45 | 5 | 0.1% |
| 19.55 | 5 | 0.1% |
| 20.05 | 5 | 0.1% |
| 20.25 | 5 | 0.1% |
| Other values (4696) | 4935 |
| Value | Count | Frequency (%) |
| 18.8 | 1 | < 0.1% |
| 18.85 | 1 | < 0.1% |
| 18.9 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 19.05 | 1 | < 0.1% |
| 19.1 | 2 | |
| 19.15 | 1 | < 0.1% |
| 19.2 | 4 | |
| 19.25 | 1 | < 0.1% |
| 19.3 | 2 |
| Value | Count | Frequency (%) |
| 8684.8 | 1 | |
| 8672.45 | 1 | |
| 8564.75 | 1 | |
| 8529.5 | 1 | |
| 8496.7 | 1 | |
| 8477.7 | 1 | |
| 8477.6 | 1 | |
| 8476.5 | 1 | |
| 8468.2 | 1 | |
| 8456.75 | 1 |
| Distinct | 73 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.1848 |
| Minimum | 0 |
|---|---|
| Maximum | 72 |
| Zeros | 8 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 8 |
| median | 28 |
| Q3 | 55 |
| 95-th percentile | 72 |
| Maximum | 72 |
| Range | 72 |
| Interquartile range (IQR) | 47 |
Descriptive statistics
| Standard deviation | 24.63672954 |
|---|---|
| Coefficient of variation (CV) | 0.7654771676 |
| Kurtosis | -1.383097498 |
| Mean | 32.1848 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | 0.2572738005 |
| Sum | 160924 |
| Variance | 606.9684426 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 436 | 8.7% |
| 72 | 269 | 5.4% |
| 2 | 180 | 3.6% |
| 3 | 141 | 2.8% |
| 4 | 130 | 2.6% |
| 71 | 128 | 2.6% |
| 7 | 102 | 2.0% |
| 5 | 97 | 1.9% |
| 8 | 94 | 1.9% |
| 12 | 88 | 1.8% |
| Other values (63) | 3335 |
| Value | Count | Frequency (%) |
| 0 | 8 | 0.2% |
| 1 | 436 | |
| 2 | 180 | |
| 3 | 141 | 2.8% |
| 4 | 130 | 2.6% |
| 5 | 97 | 1.9% |
| 6 | 65 | 1.3% |
| 7 | 102 | 2.0% |
| 8 | 94 | 1.9% |
| 9 | 81 | 1.6% |
| Value | Count | Frequency (%) |
| 72 | 269 | |
| 71 | 128 | |
| 70 | 75 | 1.5% |
| 69 | 67 | 1.3% |
| 68 | 71 | 1.4% |
| 67 | 65 | 1.3% |
| 66 | 66 | 1.3% |
| 65 | 55 | 1.1% |
| 64 | 58 | 1.2% |
| 63 | 53 | 1.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| PG | |
|---|---|
| Graduation | |
| Intermediate | 27 |
| High School or less | 14 |
Length
| Max length | 19 |
|---|---|
| Median length | 2 |
| Mean length | 5.2696 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | High School or less |
|---|---|
| 2nd row | High School or less |
| 3rd row | High School or less |
| 4th row | High School or less |
| 5th row | High School or less |
Common Values
| Value | Count | Frequency (%) |
| PG | 2979 | |
| Graduation | 1980 | |
| Intermediate | 27 | 0.5% |
| High School or less | 14 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| pg | 2979 | |
| graduation | 1980 | |
| intermediate | 27 | 0.5% |
| high | 14 | 0.3% |
| school | 14 | 0.3% |
| or | 14 | 0.3% |
| less | 14 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Gender | Business | Dependancies | Calls | Type | Billing | Rating | Age | Salary | Base_pay | Bonus | Unit_Price | Volume | openingbalance | closingbalance | low | Unit_Sales | Total_Sales | Months | Education | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Female | No | No | Yes | Month-to-month | No | Yes | 18 | 5089.00000 | 2035.600000 | 254.450000 | 3.77 | 21226600 | 3.7500 | 3.760 | 3.65 | 18.25 | 18.80 | 0 | High School or less |
| 1 | Female | No | No | Yes | Month-to-month | No | Yes | 19 | 5698.12000 | 2279.248000 | 284.906000 | 3.74 | 10462800 | 3.8500 | 3.680 | 3.65 | 18.40 | 18.85 | 0 | High School or less |
| 2 | Male | No | No | Yes | Month-to-month | Yes | No | 22 | 5896.65000 | 2358.660000 | 294.832500 | 3.89 | 18761000 | 4.2300 | 4.290 | 3.72 | 18.70 | 18.90 | 0 | High School or less |
| 3 | Female | Yes | No | Yes | Month-to-month | Yes | Yes | 21 | 6125.12000 | 2450.048000 | 306.256000 | 4.35 | 66130600 | 4.2600 | 4.310 | 3.83 | 18.75 | 19.00 | 0 | High School or less |
| 4 | Male | No | No | Yes | Month-to-month | Yes | Yes | 23 | 6245.00000 | 2498.000000 | 312.250000 | 4.34 | 26868200 | 4.7900 | 4.410 | 4.08 | 18.80 | 19.05 | 1 | High School or less |
| 5 | Male | No | No | Yes | Two year | Yes | No | 23 | 6444.23000 | 2577.692000 | 322.211500 | 4.37 | 29869600 | 5.8800 | 5.040 | 4.13 | 18.80 | 19.10 | 1 | High School or less |
| 6 | Male | No | Yes | No | Two year | Yes | No | 23 | 6455.50000 | 2582.200000 | 322.775000 | 4.42 | 25239200 | 6.0925 | 5.590 | 4.15 | 18.80 | 19.10 | 1 | High School or less |
| 7 | Female | No | No | Yes | One year | Yes | No | 24 | 6458.35722 | 2583.342888 | 322.917861 | 4.44 | 28307500 | 6.1000 | 5.670 | 4.21 | 18.80 | 19.15 | 1 | Intermediate |
| 8 | Female | Yes | No | Yes | Month-to-month | Yes | Yes | 24 | 6529.23000 | 2611.692000 | 326.461500 | 4.45 | 24295600 | 6.1500 | 6.170 | 4.27 | 18.85 | 19.20 | 1 | Intermediate |
| 9 | Male | No | No | Yes | Month-to-month | Yes | No | 43 | 6682.33000 | 2672.932000 | 334.116500 | 4.41 | 17671600 | 6.2600 | 6.095 | 4.22 | 18.85 | 19.20 | 1 | Intermediate |
Last rows
| Gender | Business | Dependancies | Calls | Type | Billing | Rating | Age | Salary | Base_pay | Bonus | Unit_Price | Volume | openingbalance | closingbalance | low | Unit_Sales | Total_Sales | Months | Education | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4990 | Male | No | No | Yes | Month-to-month | No | No | 70 | 168974.5280 | 61235.51239 | 8448.726400 | 312.500000 | 317200 | 33.119999 | 223.960007 | 307.399994 | 116.85 | 8672.45 | 72 | PG |
| 4991 | Male | Yes | No | Yes | Two year | No | No | 70 | 169149.7070 | 67659.88280 | 8457.485350 | 309.660004 | 443500 | 33.119999 | 219.080002 | 302.779999 | 117.15 | 8684.80 | 72 | PG |
| 4992 | Male | Yes | No | Yes | One year | No | No | 71 | 170372.5473 | 68149.01893 | 8518.627365 | 312.700012 | 295300 | 33.119999 | 238.089996 | 308.489990 | 117.20 | 1395.65 | 72 | PG |
| 4993 | Male | No | No | Yes | Month-to-month | Yes | Yes | 71 | 170639.5565 | 68255.82259 | 8531.977825 | 314.000000 | 294600 | 33.119999 | 237.899994 | 309.420013 | 117.45 | 1395.65 | 72 | PG |
| 4994 | Male | No | No | Yes | Month-to-month | Yes | No | 71 | 175689.3000 | 70275.72000 | 8784.465000 | 625.861078 | 7987100 | 33.119999 | 238.470001 | 302.048370 | 117.60 | 1395.65 | 72 | PG |
| 4995 | Female | No | No | Yes | Month-to-month | No | No | 72 | 180696.8000 | 72278.72000 | 9034.840000 | 629.511067 | 3927000 | 33.119999 | 293.838840 | 310.955001 | 117.80 | 1395.65 | 72 | PG |
| 4996 | Male | No | No | Yes | Month-to-month | Yes | No | 73 | 185685.9000 | 74274.36000 | 9284.295000 | 627.841071 | 6031900 | 33.119999 | 301.311314 | 309.610028 | 118.60 | 1395.65 | 72 | PG |
| 4997 | Male | No | No | Yes | Month-to-month | Yes | No | 74 | 192636.8000 | 77054.72000 | 9631.840000 | 625.860033 | 7949400 | 33.119999 | 306.040009 | 303.483494 | 118.60 | 1395.65 | 72 | PG |
| 4998 | Male | Yes | No | Yes | Month-to-month | Yes | Yes | 74 | 195970.7000 | 78388.28000 | 9798.535000 | 629.510005 | 3908400 | 33.119999 | 308.579987 | 312.432438 | 118.65 | 1395.65 | 72 | PG |
| 4999 | Male | No | Yes | Yes | Two year | Yes | No | 88 | 199970.7400 | 79988.29600 | 9998.537000 | 627.839984 | 6003300 | 33.119999 | 312.307316 | 311.081089 | 118.75 | 1395.65 | 72 | PG |